[EQUEiNCE LISTING 



tb41 (MTCC#2) 



INFORMATION FOR SSQ ID NO\:140: 

(i) SEQUENCE CHARACTERISTICS: 



(A) 
(3) 
(C) 
(D) 



LENGTH j 1441 has pairs 
TYPE: ^nucleic aci<^ 
STRANDEDNESS : single 
TOPOLOGY : linear 



(ii) MOLECULE TYPE : cDNA 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 0 



GAGGTTGCTG 
TTCCGGTCCG 
GGAGTTGACT 
GTGGATGGGG 
GGCCGCCACG 
GTTTGGGACG 
GTTGATGTCG 
CCAGGCCGAG 
GGCATCTGCG 
GGCCGGGCCC 
TGCACAGGCG 
GGCCGCCAAC 
GCAAGTCGGA 
CGCGCTCTAC 
CAGACCCTGG 
TCCACTGAGT 
GCCGGTGTCC 
CTGG AC CACG 
CAGCGCCGGC 
GGCTTTGGCG 
CAGCACTGAC 
GCCGCCCGGA 
CCTGGTGAGC 
GACGACGGTG 
G 



GCAATGGATT 
GGGCCGGAGT 
TCCGCCGCGG 
CCGGCGGCGG 
GCGGCGCTGG 
GCGTTCGCGA 
CTGGTCGCGG 
TATGCCGAAA 
GCCGCGTCGG 
GCGGC CGCAG 
ACACTGGCCC 
GCTGATCCGC 
TCCGCTCAGC 
ATTGCATCCA 
CACATCGGCC 
TCGGCGACCG 
GCGGGCGTCG 
GCCGCCCCGG 
GCCGACCCGA 
AGCCTGGCCG 
GGCCAAGAGG 
AACCCCCCGC 
GTGGCTATCC 
TCTGGATTCT 



TCGGGCTT TT 
CGATGCTA< SC 
TCTCGTATGG 
CCGCGATGGC 
CGAAGGAGAC 
TGACGGTG 1 
CGAACATTCl 
TGTGGGCC 
CGTTGCCGO 
CCGCGGCGA 
AGCTG'CCCO 
TGACATCGO 
CGATAGTGATl 
TCGCGACCGG 
TATACGGGAA 
ACGAGCCGGA 
GCCACGCAGC 
AGATC CAGCT 
CGGCCCTAAA 
CACGCGGCAC 
ACGGC CGCAA 
GGTAAAAGTC 
GACGGGCCGT 
CGGCGGCTAT 



ACCTCCGGAA 
CGCCGCGGCC 
ATCGGTGGTG 
GGCCGCGGCA 
GGCCACACAG 
ACCATCCCTC 
GGGGCAAAAC 
AGACGCTGCC 
GTTCACTCCA 
CCAAGCCGCC 
GGGGATCCTG 
ACTGTTGGGG 
CCCCACCCCG 
CAGCATTGCG 
iCGCCGGCGGG 

[gccgcactgg 

lTTAGTCGGA 

:gccgttcag 
:gggatgccg 
(acgggcggt 
;ccccggta 
cggcaaccgt 
tcacaccgct 
cagagcgatt 



GTGAATTCAA 
GCCTGGGACG 
TCGACGCTGA 
ACGCCGTATG 
GCGAGGGCAG 
GTCGCGGCCA 
AGTGCGGCGA 
GTGATGTACA 
CCCGTGCAAG 
GGTGCGGGCG 
AGCGACATTC 
ATCGCGTCGA 
ATAGGGGAAT 
CTCGCGATCA 
CTGGGACCGA 
GGCCCCTTCG 
GCGTTGTCGG 
GCAACACCCA 
GCAGGCCTGC 
GGCGGCACCC 
GTTGTGATTA 
TCGTCGCCGC 
TGTAGTAGCG 
TTGCTCGCAA 



GCCGAATGTA 
GTGTGGCCGC 
TCGTTGAGCC 
TGGGGTGGCT 
CGGCGGAAGC 
ACCGCAGCCG 
TCGCGGCTAC 
GCTATGAGGG 
GCACCGGCCC 
CCGTTGCGGA 
TGTCCGCATT 
CCCTCAACCC 
TGGACGTGAT 
CGAACACGGC 
CGCAGGGCCA 
GGGGCGCGGC 
TGC CGCACAG 
CCTTCAGCTC 
TCAGCGGGAT 
GTAGCGGCAC 
GAGAGCAGCC 
GCGGAAAATG 
TACGGCTATG 
CCTCAGCAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1441 



(2) INFORMATION FOR SEQ ID NO: 142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 423 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TO POLOGY : 1 ine ar 

(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID\NO:142: 



Met Asp Phe Gly Leu Leu Pro Pro Glu Val Wsn Ser Ser Arg Met Tyr 
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\ 



Ser Gly Pro Gly Pro Glu Ser Met Leu Ala Ala Ala Ala Ala Trp Asp 



20 25 30 

Gly Val Ala Ala Glu Leu Thr Ser Ala Ala Val Ser Tyr Gly Ser Val 

35 40 45 

Val Ser Thr Leu lie Val Glu Pro Trp Met Gly Pro Ala Ala Ala Ala 

50 55 60 

Met Ala Ala Ala Ala Thr Pro Tyr Val Gly Trp Leu Ala Ala Thr Ala 
65 70 75 30 

Ala Leu Ala Lys Glu Thr Ala Thr Gin Ala Arg Ala Ala Ala Glu Ala 

35 90 95 

Phe Gly Thr Ala Phe Ala Met Thr Val Pro Pro Ser Leu Val Ala Ala 

100 105 110 

Asn Arg Ser Arg Leu Met Ser Leu Val Ala Ala Asn lie Leu Gly Gin 

115 120 125 

Asn Ser Ala Ala lie Ala Ala Thr Gin Ala Glu Tyr Ala Glu Met Trp 

130 135 140 

Ala Gin Asp Ala Ala Val Met Tyr Ser Tyr Glu Gly Ala Ser Ala Ala 
145 150 155 160 

Ala Ser Ala Leu Pro Pro Phe Thr Pro Pro Val Gin Gly Thr Gly Pro 

165 170 175 

Ala Gly Pro Ala Ala Ala Ala Ala Ala Thr Gin Ala Ala Gly Ala Gly 

180 ' 135 190 

Ala Val Ala Asp Ala Gin Ala Thr Leu Ala Gin Leu Pro Pro Gly lie 

195 200 205 

Leu Ser Asp lie Leu Ser Ala Leu Ala Ala Asn Ala Asp Pro Leu Thr 

210 215 220 

Ser Gly Leu Leu Gly lie Ala Ser Thr Leu Asn Pro Gin Val Gly Ser 
225 230 235 240 

Ala Gin Pro lie Val lie Pro Thr Pro lie Gly Glu Leu Asp Val lie 

245 250 255 

Ala Leu Tyr lie Ala Ser lie Ala Thr Gly Ser lie Ala Leu Ala lie 

260 265 270 

Thr Asn Thr Ala Arg Pro Trp His lie Gly Leu Tyr Gly Asn Ala Gly 

275 280 285 

Gly Leu Gly Pro Thr Gin Gly His Pro Leu Ser Ser Ala Thr Asp Glu 

290 295 300 

Pro Glu Pro His Trp Gly Pro Phe Gly Gly Ala Ala Pro Val Ser Ala 
305 310 315 320 

Gly Val Gly His Ala Ala Leu Val Gly Ala Leu Ser Val Pro His Ser 

325 330 335 

Trp Thr Thr Ala Ala Pro Glu lie Gin Leu Ala Val Gin Ala Thr Pro 

340 345 350 

Thr Phe Ser Ser Ser Ala Gly Ala Asp Pro Thr Ala Leu Asn Gly Met 

355 360 365 

Pro Ala Gly Leu Leu Ser Gly Met Ala Leu Ala Ser Leu Ala Ala Arg 

370 375 380 

Gly Thr Thr Gly Gly Gly Gly Thr Arg Ser Gly Thr Ser Thr Asp Gly 
385 390 395 400 

Gin Glu Asp Gly Arg Lys Pro Pro Val Val Val lie Arg Glu Gin Pro 

405 410 415 

Pro Pro Gly Asn Pro Pro Arg 
420 



Mtb40 (HTCC#1) 

(2) INFORMATION FOR SEQ ID NO: 13 7: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1200 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 7 : 



SO 
120 
180 
240 
300 
3S0 
420 
480 
540 
S00 
S60 
720 
780 
840 
900 
960 
1020 



?t^ ATGAG CAGAGCGTTC ATCATCGATC CAACGATCAG TGCCATTr- i n „„ 
ACCTTCTGGG GATTGGAATA CCCAACCAAG GGGGTATCCT SaSSS^ GGCTTGTAGG 
TCGAAAAAGC CCTGGAGGAG CTGGCAGCAG CGTTTCCGGG Irz^ ^ CTAGAGTA CT 
CCGCGGACAA ATACGCCGGC AAAAACCGCA I™* 3 ™** 
ACCTCGATCG TCAGCTCATC AGCCTGATCC ACGACCAcSc" 

GCGACATCCT GGAGGGCGCC AAGAAAGGTC TCGAGTTCGT GC^SSSJ ^CGACCC 
TGACCTACAT CCCGGTCGTC GGGCACGCCC TATCGGCCgJ StSgS ^J 0106 ** 
CGGGCGCGAT GGCCGTAGTG GGCGGCGCGC TTGCCTACCT Stc^S^ CCGTTTTGCG 
ACGCGACTCA ACTCCTCAAA TTGCTTGCCA AATTGgJcS ACGCTGATCA 
CGGACATCAT TTCGGATGTG GCGGACATCA TCAAGgSSc GCCGCCATTG 

tcatcacaaa cgcgctcaac ggcctgaaag agctttcSa gtg tgggagt 

CCGGACTGTT CTCTCGAGGG TGGTCGAACC TGGAGTCcS SJ52S£ ^° TGGGTGA 

tgaccggcgc gaccagcggc ttgtcgcaag tgactggcS StccSSS gtccccggct 
ccgcatcgtc gggcttggct cacgcggata gcctggcSS SSSS gc cggtctgt 
tggccggcat tgggggcggg tccggttttg ggggcttgJc SSSSSS ttgcccgccc 

CCGCCTCAAC TCGGCAGGCG CTACGGCCCC GAGcSSS CCCGO^Scr S* 3 ™*™ 

AGCAGGTCGG CGGGCAGTCG cagctggtct ccgcgcaggg ScS^ gccgctgccg 
ccgtaggcat gggcggcatg cacccctctt cggcJgcgt? gIS2? GG J atgggcggac 

AGTACTCGGA AGGCGCGGCG GCGGGCACTG AAGACGCcS SJJSS ACGACGAA GA 1080 

acgcggccgc tgggcaaaag gtcctggtac oJSSSJ SSSS g~Sa G li:° 0 

(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 92 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 8: 
Met Ser Arg Ala Phe He He Asp Pro Thr He Ser Ala He Asp Gly 
Ty, Asp Leu Leu Gly He Gly lie Pro Asn Gin Gly oly £ Leu 
Tyr ser Ser Leu Glu Tyr Phe Glu Lys Ala Leu Glu Glu IL Ala Ala 

Ala Phe Pro Gly Asp Gly Trp tl Gly Ser Ala Ala Asp Lys Tyr Ala 

SO 

Gly Lys Asn Arg Asa His Val Asn Phe Phe Gin Glu Leu Ala Asp" Leu 
Asp Arg Gin Leu lie Ser Leu He His Asp Gin Ala Asn Ala Val gL 
Thr Thr Arg Asp He Leu Glu Gly Ala Lys Lys Gly Leu Glu Phe Val 
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Arg Pro Val Ma Val Asp Leu Thr Tyr Ha Pro Val Val Gly His Ala 

Leu Ser Ala Ala Phe Gin Ala Pro Phe Cys Ala Gly HI Met ^ Val 

Val Gly Gly Ala Leu Ala Tyr Leu Val Val Lys Thr Leu lie Asn Ala 

Thr Gin Leu Leu Lys Leu Leu Ala Lys Leu Ala Glu Leu Val Ala Ma 

Ala lie Ala Asp He lie Ser Asp Val Ala Asp lie He Lys Gly Thr 

Leu Gly Glu Val Trp Glu Phe He J" Asn Ala Leu Asn Gly Leu Lvs 

195 2 °0 205 

Glu Leu Trp Asp Lys Leu Thr Gly Trp Val Thr Gly Leu Phe Ser Ara 

Gly Trp Ser Asn Leu Glu Ser Phe Phe Ala Gly Val Pro Gly Leu Thr 

Gly Ala Thr Ser Gly Leu Ser Gin Val Thr Gly Leu Phe Gly Ala Ma 

Gly Leu Ser Ma Ser Ser Gly Leu Ala His Ala Asp Ser Leu III Ser 

Ser Ala Ser Leu Pro Ala Leu Ma Gly He Gly Gly Gly Ser aiy Phe 

Gly Gly Leu Pro Ser Leu Ma Gin Val His Ala Ala Ser Thr Arg Gin 

Ma Leu Arg Pro Arg Ma Asp Gly Pro Val Gly Ala Ala Ala Glu Gin 

Val Gly Gly Gin Ser Gin Leu Val Ser Ala G^n Gly Ser Gin Gly Met 

Gly Gly Pro Val Gly Met Gly Gly Met His Pro Ser Ser Gly HI S er 

Lys Gly Thr Thr Thr Lys Lys Tyr Ser Glu Gly Ala Ala Ma Gly Thr 

360 3S5 1 

Glu Asp Ma Glu Arg Ala Pro Val Glu Ala Asp Ala Gly Gly Gly Gin 

; 375 33o 

Lys Val Leu Val Arg Asn Val Val 
385 390 



Mtb9.9A (MTI-A) 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1742 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 

CCGCTCTCTT TCAACGTCAT AAGTTCGGTG GGCCAGTCGG CCGCGCGTGC ATATGGCACC 
AATAACGCGT GTCCCATGGA TACCCGGACC GCACGACGGT AGAGCGGATC 120 
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CTACCGCGTC CACGCTCAGC CCTGCCGCGT TGCGGAAGAT CG&Gcrrw 

TTCTCATGGT CGTTAACGCC TTCCAACACT GCGACGGTGC GCGCCCCGGC gSp^ 13 ° 

GCAACGCTCG GCTCCGGCAC CCGGCGCGCG GCTGCCAACA CCCCmSS 240 

CCGATCACCC GTGCCATGAC ATCAGCCGAC GCTCGATAGT SS££5£ 3 °° 

AGATCATCCT TGAGCTCGGC CAGCCGCCGG TCGGTGCCGA ACAgScSg SSS?*** 3 ?° 

cgtgaggcca gcatgcgctg caccaccagc acaccctcgg cgmSSJ r^S 5 ^ 0 420 

gtcggcagat cgggacnacn gtcgatgctg ttcaggtcac SaISccS? Sr?^ GCCG 480 

tcgtcgggat cgcagacgtc ctgaacatcg aggccgtcgg gSJSSS 2 GCCGTGGG 540 

TCGGTCACGG GCTTTCGTCG ACCAGAGCCA GCATCAGATC GgSSSsg JSSSSf 

CACGCTCGCT GCGGTTCAGC GTCGCGAGCC GCTCAGCCAG CCACtSS SSS*^ S5 ° 
TGCTGGGATT AATTGGGAGA GGAAGACAGC ATGTCGTTCG SaISScA SSSS" 
CTGGCAGCTG CGGCGGCGAA CCTACAGGGT ATTGGCACGA cStSSS cSS^ 

GCCGCGGCTG CTCCAACCAC CGGAGTAGTG CCCGCAGCCG CCGAtcJ££ f2S£S ° 84 ° 

ACCGCGGCTC AGTTTGCTGC GCACGCGCAG ATGTACCAAA cSJSSSS SSS? 0 ™ 9 °° 

GCCATTCACG AAATGTTCGT GAACACGCTG GTGGCCAGTT ctSSSS SSS?** 30 9S ° 

GAGGCGGCCA ACGCAGCCGC TGCCGGCTGA ACGGGCTCGC aSSSSgc SSS""* 102 ° 

GGGGAAC AT C CGGAGTTCTC GGGTCAGGGG TTGCGCCAGC oSSJSS ESiSS™ 0 108 ° 

GGCGTCCATA acagcagacg atctaggcat tcagtactaa gSSJSJ ttcagntatc • 1140 

CACGTTTTAT GACGGATCCG CATGCGATgJ SaStSS ££££££ 1200 

CCCAGACGGT GGAGGACGAG GCTCGCCGGA TGTGGGCGTC CGCgSHS CG 12S ° 

cgggctggag tggcatggcc gaggcgacct cgctagaSc StgSc^S JJJIS 00 ™ 1320 

CGTTTCGCAA CATCGTGAAC ATGCTGCACG GGGTGCGTGA SSSS JJSJ^ 133 ° 

ACAANTACGA ACAGCAAGAG CAGGCCTCCC AGCAGATCCT crrrr** " 4 ° 

CACAGCTGNG TACGNTTTCT CACATTAGGA GAACACCAAT ATgSSSa SSS*** 1S °° 

cggggacgtc gacgctcatg gcgccatgat ccgcgctcag g"cc5St~cS ™ CCAGTT 1560 

GCATCAGGCC ATCGTTCGTG ATGTGTTGGC COCOGqJSS SSSSg SSSS^ 162 ° 

ggtggcttgc caggagtxca ttacccagtt gggccgta.c tSSSS ?c?acSgca C iJ£ 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 83S base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

GTTGATTCCG TTCGCGGCGC CGCCGAAGAC CACCAACTCC GCTGGGGTGG rrrrmnnr. 
£S5££ GTCAGCTGGC CGAATCCCAA TGATTGGTGG ctSSScS ScS^ 
^™f CCC CACGGAAAGG ACGACGATCG TTCGTTTGCT CGGTCAGTCG tISSS 
C^rZZ GCGGTTTCTT ^CTCGATCG CACAGCAGCT GACCTTcSc cJSScSS 
CAACGGCTGG CTCCGGCGGA GCCTGGTACC CAACGCCACA ATTCGCCGGC CTgSSSS 
r^r~ T GTCGGCGAGT raC-CGCGGG CGGAGCCGGT CGGG^SSg £££££ 
CAAGTTGGGC CGTCGCGGCT CCGGCCTTCG CGGAGAAGCC TGAGGCGGGC AcSrrn^ 
CGAAGCGTCC AGCTGCGGTC AGGGAGGCCT GCT^SSc ISSSS 
CGAGAGCGGG GCGGCGTACA GGCGCCTTCG CTCACCGATA CGGGTTCCGC 
TTACCCGGTC TCCGTCGGCG GGATAGCTTT CGATCCGGTC TGCGCGGCCG SSS^ 
TGCAGATAGC GATCGACCGC GCCGGTCGGT AAACGCCGCA CACcSScTA SSS£2f 
CGGCGGGCGT TGATGCCAAA TTGACCGTCC CGACGGGGCT TTATCTGCGG 
TCCCCAGCCC GGTCGGTGGG CCGATAAATA CGCTGGTCAG ScgStcS CC^SgIS 



60 
12 0 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
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TCGATGCTCT GGGCGCCCGC TCGACGCCGA GTATCTCGAG TGGGCCGCAA ACCCGGTCAA 
ACGCTGTTAC TGTGGCGTTA CCACAGGTGA ATTTGCGGTG CCAACTGGTG AACACTTGCG 
AACGGGTGGC ATCGAAATCA ACTTGTTGCG TTGCAGTGAT CTACTCTCTT GCAGAGAGCC 
GTTGCTGGGA TTAATTGGGA GAGGAAGACA GCATGTCGTT CGTGACCACA C^GCCGGAAG 
CCCTGGCAGC TGCGGCGGCG AACCTACAGG GTATTGGCAC GACAATGAAC GCCCAGAACr 
CGGCCGCGGC TGCTCCAACC ACCGGAGTAG TGCCCGCAGC CGCCGATGAA GTATCAGCGC 
TGACCGCGGC TCAGTTTGCT GCGCACGCGC AGATGTACCA AACGGTCAGC GCCCAGGCCG 
CGGCCATTCA CGAAATGTTC GTGAACACGC TGGTGGCCAG TTCTGGCTCA TACGCGGCCA 
CCGAGGCGGC CAACGCAGCC GCTGCCGGCT GAACGGGCTC GCACGAACCT GCTGAAGGAG 
AGGGGGAACA TC CGGAGTTC TCGGGTCAGG GGTTGCGCCA GCGCCCAGCC GATTCAGCTA 
TCGGCGTCCA TAACAGCAGA CGATCTAGGC ATTCAGTACT AAGGAGACAG GCAACATGGC 
CTCACGTTTT ATGACGGATC CGCATGCGAT GCGGGACATG GCGGGCCGTT TTGAGGTGCA 
CGCCCAGACG GTGGAGGACG AGGCTCGCCG GATGTGGGCG TCCGCGCAAA ACATTTCCGG 
TGCGGGCTGG AGTGGCATGG CCGAGGCGAC CTCGCTAGAC ACCATGACCT AGATGAATCA 1620 
GGCGTTTCGC AACATCGTGA ACATGCTGCA CGGGGTGCGT GACGGGCTGG TTCGCGACGC 1630 
CAACAACTAC GAACAGCAAG AGCAGGCCTC CCAGCAGATC CTGAGCAGCT AGCGCCGAAA 1740 
GCCACAGCTG CGTACGCTTT CTCACATTAG GAGAACACCA ATATGACGAT TAATTACCAG 19 00 

TTCGGGGACG TCGACGCTCA TGGCGCCATG ATCCGCGCTC AGGCGGCGTC GCTTGAGGCG 13 SO 
M GAGCATCAGG CCATCGTTCG TGATGTGTTG GCCGCGGGTG ACTTTTGGGG CGGCGCCGGT 192 0 

^ TCGGTGGCTT GCCAGGAGTT CATTACCCAG TTGGGCCGTA ACTTCCAGGT GATCTACGAG 1980 

MS CAGGCCAACG CCCACGGGCA GAAGGTGCAG GCTGCCGGCA ACAACATGGC GCAAACCGAC 

m ' AGCGCCGTCG GCTCCAGCTG GGCCTAAAAC TGAACTTCAG TCGCGGCAGC ACACCAACCA 
ty GCCGGTGTGC TGCTGTGTCC TGCAGTTAAC TAGCACTCGA CCGCTGAGGT AGCGATGGAT , ibU 

m CAACAGAGTA CCCGCACCGA CATCACCGTC AACGTCGACG GCTTCTGGAT GCTTCAGGCG 222 0 

Si CTACTGGATA TCCGCCACGT TGCGCCTGAG TTACGTTGCC GGCCGTACGT CTCCACCGAT 22 80 

iy TCCAATGACT GGCTAAACGA GCACCCGGGG ATGGCGGTCA TGCGCGAGCA GGG CATTGTC 2340 

GTCAACGACG CGGTCAACGA ACAGGTCGCT GCCCGGATGA AGGTGCTTGC CGCACCTGAT 2400 
^ CTTGAAGTCG TCGCCCTGCT GTCACGCGGC AAGTTGCTGT ACGGGGTCAT AGACGACGAG 24 6 0 

3 ^ AACCAGCCGC CGGGTTCGCG TGACATCCCT GACAATGAGT TC CGGGTGGT GTTGGCCCGG 252 0 

^ CGAGGC CAGC ACTGGGTGTC GGCGGTACGG GTTGGCAATG ACATCACCGT CGATGACGTG 2 58 0 

g ACGGTCTCGG ATAGCGCCTC GATCGCCGCA CTGGTAATGG ACGGTCTGGA GTCGATTCAC 

3 CACGCCGACC CAGCCGCGAT. CAACGCGGTC AACGTGCCAA TGGAGGAGAT CTCGTGCCGA 

3 ATTCGGCACG AGGCACGAGG CGGTGTCGGT GACGACGGGA TCGATCACGA TCATCGACCG , /ou 

GCCGGGATCC TTGGCGATCT CGTTGAGCAC GACCCGGGCC CGCGGGAAGC TCTGCGACAT 2fl^n 
CCATGGGTTC TTCCCG 



840 
900 
950 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



2040 
2100 
2160 



2640 
2700 
2760 



2836 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) . MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 

Met Thr He Asn Tyr Gin Phe Gly Asp Val Asp Ala His Gly Ala Met 

I 5 10 15 

He Arg Ala Leu Ala Gly Leu Leu Glu Ala Glu His Gin Ala He He 

20 25 30 

Ser Asp Val Leu Thr Ala Ser Asp Phe Trp Gly Gly Ala Gly Ser Ala 
35 40 45 
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Ala Cys Gin Gly Phe 
SO 

Tyr Glu Gin Ala Asn 
65 

Asn Met Ala Gin Thr 
85 



lie Thr Gin Leu Gly Arg 
55 

Ala His Gly Gin Lys Val 
70 75 
Asp Ser Ala Val Gly Ser 
90 



Asn Phe Gin Val He 
60 

Gin Ala Ala Gly Asn 
80 

Ser Trp Ala 



Mtb9.9A (MTI-A) ORF peptides 

(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 1 : 

Met Thr He Asn Tyr Gin Phe Gly Asp Val Asp Ala His Gly Ala 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 

Gin Phe Gly Asp Val Asp Ala His Gly Ala Met lie Arg Ala Gin 
15 1° 15 

(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: peptide 



(vi) ORIGINAL SOURCE: 



(A) ORGANISM: Mycobacterium tuberculosis 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 

Asp Ala His Gly Ala Met lie Arg Ala Gin Ala Ala Sar Leu Glu 
1 5 10 IS 

(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Met lie Arg Ala Gin Ala Ala Ser Leu Glu Ala Glu His Gin Ala 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:55: 

Ala Ala Ser Leu Glu Ala Glu His Gin Ala lie Val Arg Asp Val 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 56: 
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Ala Glu Kis Gin Ala lie Val Arg Asp Val Leu Ala Ala Gly Asp 
1 5 10 is* 



(2) INFORMATION FOR SHQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

lie Val Arg Asp Val Leu Ala Ala Gly Asp Phe Trp Gly Gly Ala 

1 - 5 .10 is , 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

- (O) STRANDEDNESS-: single -- 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

Leu Ala Ala Gly Asp Phe Trp Gly Gly Ala Gly Ser Val Ala Cys Gin 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Phe Trp Gly Gly Ala Gly Ser Val Ala Cys Gin Glu Phe lie Thr 
1*5 10 15 

(2) INFORMATION FOR SEQ ID NO: 60: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 15 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO.-SO: 

Gly Ser Val Ala Cys Gin Glu Phe lie Thr Gin Leu Gly Arg Asn 

10 IS 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: " 

(A) ORGANISM : Mycobacterium tuberculosis 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 

Gin Glu Phe lie Thr Gin Leu Gly Arg Asn Phe Gin Val lie Tyr Glu 
Gin Ala 10 15 

(2) INFORMATION FOR SEQ ID NO: 62: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 62 : 

Arg Asn Phe Gin Val lie Tyr Glu Gin Ala Asn Ala His Gly Gin 
5 10 15 

2) INFORMATION FOR SEQ ID NO: 63: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
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(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE : peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : S3 : 



lie Tyr Glu Gin Ala Asn Ala His Gly Gin Lys Val Gin Ala Ala 
15 10 1S 

(2) INFORMATION FOR SEQ ID NO : 64 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 64 : 



Asn Ala His Gly Gin Lys Val Gin Ala Ala Gly Asn Asn Met Ala 
1 5 ' 10 15 

(2) INFORMATION FOR SEQ ID NO : 65 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 65 : 



Lys Val Gin Ala Ala Gly Asn Asn Met Ala Gin Thr Asp Ser Ala 
1 5 10 is 

(2) INFORMATION FOR SEQ ID NO: 66: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



il 



(ii) MOLECULE TYPE: peptide 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Gly Asn Asn Met Ala Gin Thr Asp Ser Ala Val Gly Ser Ser Trp Ala 
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15 



Mtb9.8 (MSL) 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 85 base pairs 
Q (B) TYPE: nucleic acid 

jB - (C) STRAND EDNESS : double 

Jjn (D) TOPOLOGY: linear 

gl (ii) MOLECULE TYPE : DNA (genomic) 

Cj (vi) ORIGINAL SOURCE: 

j=|J (A) ORGANISM: Mycobacterium tuberculosis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

TGGATTC CGA TAGCGGTTTC GGCCCCTCGA CGGGCGACCA CGGCGCGCAG GCCTCCGAAC 60 

GGGGGGCCGG GACGCTGGGA TTCGCCGGGA CCGCAACCAA AGAACGCCGG GTCCGGGCGG 12 0 

TCGGGCTGAC CGCACTGGCC GGTGATGAGT TCGGCAACGG CCCCCGGATG CCGATGGTGC 18 0 

CGGGGACCTG GGAGCAGGGC AGCAACGAGC CCGAGGCGCC CGACGGATCG GGGAGAGGGG 24 0 

GAGGCGACGG CTTACCGCAC GACAGCAAGT AACCGAATTC CGAATCACGT GGACCCGTAC 3 00 

GGGTCGAAAG GAGAGATGTT ATGAGCCTTT TGGATGCTCA TATCCCACAG TTGGTGGCCT 3 60 

CCCAGTCGGC GTTTGCCGCC AAGGCGGGGC TGATGCGGCA CACGATCGGT CAGGCCGAGC 420 

AGGCGGCGAT GTCGGCTCAG GCGTTTCACC AGGGGGAGTC GTCGGCGGCG TTTCAGGCCG 480 

CCCATGCCCG GTTTGTGGCG GCGGCCGCCA AAGTCAACAC CTTGTTGGAT GTCGCGCAGG 54 0 

CGAATCTGGG TGAGGCCGCC GGTACCTATG TGGCCGCCGA TGCTG 585 



(2) INFORMATION FOR SEQ ID NO: 10 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 10 9: 

Met Ser Leu Leu Asp Ala His lie Pro Gin Leu Val Ala Ser Gin Ser 

15 10 15 

Ala Phe Ala Ala Lys Ala Gly Leu Met Arg His Thr lie Gly Gin Ala 

20 25 30 

Glu Gin Ala Ala Met Ser Ala Gin Ala Phe His Gin Gly Glu Ser Ser 
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35 

Ala Ala Phe Gin Ala 
SO 

Val Asn Thr Leu Leu 
65 

Gly Thr Tyr Val Ala 
85 

Phe 



40 

Ala His Ala Arg Phe Val 
55 

Asp Val Ala Gin Ala Asn 
70 75 
Ala Asp Ala Ala Ala Ala 
90 



45 

Ala Ala Ala Ala Lys 
SO 

Leu Gly Glu Ala Ala 
80 

Ser Thr Tyr Thr Gly 
95 



Mtb9.8 ORF peptides 

(2) INFORMATION FOR SEQ ID NO: 110: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 

Met Ser Leu Leu Asp Ala His lie Pro Gin Leu Val Ala Ser Gin 
1 5 10 15 



(2) INFORMATION FOR SEQ ID NO: 111: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

Ala His lie Pro Gin Leu Val Ala Ser Gin Ser Ala Phe Ala Ala 
IS 10 15 



(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE 

Leu Val Ala Ser Gin 
1 5 



DESCRIPTION: SEQ ID 

Ser Ala Phe Ala Ala 
10 



NO: 112 : 

Lys Ala Gly Leu Met 
15 
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(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 15 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

Ser Ala Phe Ala Ala Lys Ala Gly Leu Met Arg His Thr II- Gly 
5 10 IS 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 : 
Lys Ala Gly Leu Met Arg His Thr lie Gly Gin Ala Glu Gin Ala 
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(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115 : 

Arg His Thr lie Gly Gin Ala Glu Gin Ala Ala Met Ser Ala Gin 
5 10 15 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 : 
Gin Ala Glu Gin Ala Ala Met Ser Ala Gin Ala Phe His Gin Gly 
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(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117; 

Ala Met Ser Ala Gin Ala Phe His Gin Gly Glu Ser Ser Ala Al 
5 -10 15 

(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

Ala Phe His Gin Gly Glu Ser Ser Ala Ala Phe Gin Ala Ala His 
5 10 15 

(2) INFORMATION FOR SEQ ID NO: 119: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IS amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119 : 

Glu ser Ser Ala Ala Phe Gin Ala Ala His Ala Arg Phe Val Ala 
5 10 15 

(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 



Phe Gin Ala Ala His Ala Arg Phe Val Ala Ala Alalia Lys Va 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 

Ala Arg Phe Val Ala Ala Ala Ala Lys Val Asn Thr Leu Leu Aso 

5 10 15 

(2) INFORMATION FOR SEQ ID NO: 122: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE : amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:122: 

Ala Ala Ala Lys Val Asn Thr Leu Leu Asp Val Ala Gin Ala Asn 
5 10 15 

(2) INFORMATION FOR SEQ ID NO: 12 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 

Asn Thr Leu Leu- Asp Val Ala Gin Ala Asn Leu Gly Glu Ala Ala 
5 10 15 

(2) INFORMATION FOR SEQ ID NO: 124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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{xi) SEQUENCE DESCRIPTION : SEQ ID NO: 124: 

Val Ala Gin Ala Asn Leu Gly Glu Ala Ala Gly Thr Tyr Val Ala Al- 

5 10 
Asp Ala 15 



Mtb39A (TbH9) 

(2) INFORMATION FOR SEQ ID NO:10S: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 058 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:10S: 

■ GATCGTACCC GTGCGAGTGC TCGGGCCGTT TGAGGATGGA GTGCACGTfT 

GGCATACCCA GAGATGTTGG CGGCGGCGGC TGACACCCTG ScSS££ JESSS" 
TGTGGCTAGC AATGCCGCTG CGGCGGCCCC GACGACTGGG GtSt^CCC rcrrlr^ 
TGAGGTGTCG GCGCTGACTG CGGCGCACTT CGCCGCACAT clarcr^ CCGCTGCCGA 

gagcgctcgg gctgctgcga ttcatgacca StcgSSS acccSSS Sf TCCGT 

CTCGTATGCG GCCACTGAAG TCGCCAATGC GGCGGCGGCC aStaScS SSSS* 0 
GCACGAGAAA CCACGAGAAA TAGGGACACG TAATGGTGGA TTTCGGGGCG^ TTArrarp 
AGATCAACTC CGCGAGGATG TACGCCGGCC CGGGTTCGGC ScgSSS SSSS ^ 
AGATGTGGGA CAGCGTGGCG AGTGACCTGT TTTCGGCCGC ScgSotS czrrrr^ 

tctggggtct gacggtgggg tcgtggatag gttcgtcggc ggSSSSg SrJ^ TGG 

CCTCGCCGTA TGTGGCGTGG ATGAGCGTCA CCGCGGGGCA gSSIgS UTrrT, ^ 
AGGTCCGGGT TGCTGCGGCG GCCTACGAGA CGGCGTATGG gSSSS rrrrnnnn CC 
TGATCGCCGA GAACCGTGCT GAACTGATGA TTCTGATAGC GaSaSS SSS 
ACACCCCGGC GATCGCGGTC AACGAGGCCG AATACGGcS SJSrtSc 
CCGCGATGTT TGGCTACGCC GCGGCGACGG CGACGGCGAC ££££££ SSSSSS 
AGGAGGCGCC GGAGATGACC AGCGCGGGTG GGCTCCTCGA GCAGGCcSr J™ ^ 
AGGCCTCCGA CACCGCCGCG GCGAACCAGT TGATGAACAA SSS?* 8 
AGCTGGCCCA GCCCACGCAG GGCACCACGC CTTCTTCCAA GCtoStS^ ° 
CGGTCTCGCC GCATCGGTCG CCGATCAGCA ACAtStG^ StgISSc JESS** 
CGATGACCAA CTCGGGTGTG TCGATGACCA ACACCTTGAG SSSSJS IE22J£ 

ctccggcggc ggccgcccag gccgtgcaaa ccgcggcgca aaacgSS ^ C TG 

GCTCGCTGGG CAGCTCGCTG GGTTCTTCGG GTCTGGGCGG TG^GG^Scc 

gtcgggcggc ctcggtcggt tcgttgtcgg tgccgcaggc ctoSSS 

CAGTCACCCC GGCGGCGCGG GCGCTGCCGC TGACCAGCCT ScUgSS SSISS 
GGCCCGGGCA GATGCTGGGC GGGCTGCCGG TGGGGCAGAT GcrnrS^ GCGGAAAGAG 

0(3 tgtgctgcgt ^ccgccgI SSSS £££££ 

SSSSS S?^ CGC AGACTGTCGT ™™*«* gtgaSSg gSSStgI 

TTCCGCGGCC GGCTATGACA ACAGTCAATG TGCATGACAA GTTACAGGTA tti^w™ 
ZSZF™ GAGACAGGCA ACATGGCCTC ACGTTTTATG SgStSg 

£££2?* GGCCGTTrTG aggtgcacgc ccagacggtg gaggacgaS ctcgcJcS? 

GTGGGCGTCC GCGCAAAACA TTTCCGGTGC GGGCTGGAGT GGCATGGCCG AG^SS^ 

gctagacacc atggcccaga tgaatcaggc gtttcgcaac atoskKS ™ TC 

GGTGCGTGAC GGGCTGGTTC GCGACGCCAA CAACTACGAG CAG^GC 

gcagatcctc agcagctaac gtcagccgct gcagcacaat actSJaS AGGC ^ CCCA 
acaggttcga tgaccatcaa ctatcaattc ggggatgtJg accSSSS GCGAAG f AGA 
cgcgctcagg ccgggttgct ggaggccgag catcaggccI JSSStgS CGCC JI GATC 
gcgagtgact tttggggcgg cgccggttcg gcggcctgcc aSSSS SccSg^g 



60 
120 
180 
240 
300 
360- 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
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GGCCGTAACT TCCAGGTGAT CTACGAGCAG GCCAACGCCC ACGGGCAGAA GGTGCAGGCT 2 2 30 

GCCGGCAACA ACATGGCGCA AACCGACAGC GCCGTCGGCT CCAGCTGGGC CTGACACCAG 2340 

GCCAAGGCCA GGGACGTGGT GTACGAGTGA AGTTCCTCGC GTGATCCTTC GGGTGGCAGT 2 4 00 

CTAAGTGGTC AGTGCTGGGG TGTTGGTGGT TTGCTGCTTG GCGGGTTCTT CGGTGCTGGT 2 4 60 

CAGTGCTGCT CGGGCTCGGG TGAGGACCTC GAGGCCCAGG TAGCGCCGTC CTTCGATCCA 2520 

TTCGTCGTGT TGTTCGGCGA GGACGGCTCC GACGAGGCGG ATGATCGAGG CGCGGTCGGG 2 5 30 

GAAGATGCCC ACGACGTCGG TTCGGCGTCG TACCTCTCGG TTGAGGCGTT CCTGGGGGTT 2 64 0 

GTTGGACCAG ATTTGGCGCC AGATCTGCTT GGGGAAGGCG GTGAACGCCA GCAGGTCGGT 2 7 00 

GCGGGCGGTG TCGAGGTGCT CGGCCACCGC GGGGAGTTTG TCGGTCAGAG CGTCGAGTAC 2760 

CCGATCATAT TGGGCAACAA CTGATTCGGC GTCGGGCTGG TCGTAGATGG AGTGCAGCAG 2 8 20 

GGTGCGCACC CACGGCCAGG AGGGCTTCGG GGTGGCTGCC ATCAGATTGG CTGCGTAGTG 2 380 

GGTTCTGCAG CGCTGCCAGG CCGCTGCGGG CAGGGTGGCG CCGATCGCGG CCACCAGGCC 2 94 0 

GGCGTGGGCG TCGCTGGTGA CCAGCGCGAC CCCGGACAGG CCGCGGGCGA CCAGGTCGCG 3 000 

GAAGAACGCC AGCCAGCCGG CCCCGTCCTC GGCGGAGGTG ACCTGGATGC CCAGGATC 3 05 3 



(2) INFORMATION FOR SSQ ID NO: 107: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 91 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 107: 



Met Val Asp Phe Gly Ala Leu Pro Pro Glu lie Asn Ser Ala Arg Met 
1 5 10 15 

Tyr Ala Gly Pro Gly Ser Ala Ser Leu Val Ala Ala Ala Gin Met Trp 
20 25 30 

Asp Ser Val Ala Ser Asp Leu Phe Ser Ala Ala Ser Ala Phe Gin Ser 
35 40 45 

Val Val Trp Gly Leu Thr Val Gly Ser Trp He Gly Ser Ser Ala Gly 
50 55 60 

Leu Met Val Ala Ala Ala Ser Pro Tyr Val Ala Trp Met Ser Val Thr 
65 70 75 80 

Ala Gly Gin Ala Glu Leu Thr Ala Ala Gin Val Arg Val Ala Ala Ala 
85 90 95 

Ala Tyr Glu Thr Ala Tyr Gly Leu Thr Val Pro Pro Pro Val He Ala 
100 105 no 

Glu Asn Arg Ala Glu Leu Met He Leu He Ala Thr Asn Leu Leu Gly 
115 120 125 

Gin Asn Thr Pro Ala He Ala Val Asn Glu Ala Glu Tyr Gly Glu Met 
130 135 140 

Trp Ala Gin Asp Ala Ala Ala Met Phe Gly Tyr Ala Ala Ala Thr Ala 
145 150 155 iso 

Thr Ala Thr Ala Thr Leu Leu Pro Phe Glu Glu Ala Pro Glu Met Thr 
165 170 175 



IS 



Ser Ala Gly Gly Leu Leu Glu Gin Ala Ala Ala Val Glu Glu Ala Ser 

X8S 190 
Asp Thr Ala Ala Ala Asn Gin Leu Met Asn As n Val P ro oia Ala Leu 

200 205 

Gin Gin Leu Ala Gin Pro Thr Gin Gly Thr Thr Pro Ser Ser Lys Leu 

215 220 

Gly Gly Leu Trp Lys Thr Val Ser Pro His Arg Ser P ro lie Ser Asn 

235 240 
Met Val Ser Met Ala Asn Asn His Met Ser Met Thr Asn Ser Gly Val 

245 250 255 

Ser Met Thr Asn Thr Leu Ser Ser Met Leu Lys Gly Phe Ala Pro Ala- 

265 270 

Ala Ala Ala Gin Ala Val Gin Thr Ala Ala Gin Asn Gly Val Axg Ala 
275 290 235 

Met Ser Ser Leu Gly Ser Ser Leu Gly Ser Ser Gly Leu Gly Gly Gly 

295 300 

Val Ala Ala Asn Leu Gly Arg Ala Ala Ser Val Gly Ser Leu Ser Val 

310 -lie 



315 320- 



Pro Gin Ala Trp Ala Ala Ala Asn Gin Ala Val Thr Pro Ala Ala Ara 

325 330 335 

Ala Leu Pro Leu Thr Ser Leu Thr Ser Ala Ala Glu Arg Gly Pro Gly 

350 

Gin Met Leu Gly Gly Leu Pro Val Gly Gin Met Gly Ala Arg Ala Gly 

360 365 
Gly Gly Leu Ser Gly Val Leu Arg Val Pro Pro Arg Pro Tyr Val Met 



380 



Pro His Ser Pro Ala Ala Gly 
385 390 



Mtb32A (TbRa35) 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GACTACGTTG GTGTAGAAAA ATCCTGCCGC CCGGACCCTT AAGGCTGGGA CAATTTCTGA 
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TAGCTACCCC GACACAGGAG GTTACGGGAT GAGCAATTCG CGCCGCCGCT r*rm~ 
GTCATGGTTG CTGAGCGTGC TGGCTGCCGT CGGGCTGGGC SScSS SSSf" 
GGCGGCCCCG CCGGCCTTGT CGCAGGACCG GTTCGCCGAC SclcSSS S^S?* 
CCCGTCCGCG ATGGTCGCCC AAGTGGCGCC ACAGGTGGTC aIS^cSS 
CTACAACAAC GCCGTGGGCG CCGGGACCGG CATCGTCATC GATCCCAACG ^ 
GACCAACAAC CACGTGATCG CGGGCGCCAC CGACATCAA^ GcJSSJcg 
CCAAACCTAC GGCGTCGATG TGGTCGGGTA TGACCGCACC SSS^ 

gctgcgcggt gccggtggcc tgccgtcggc ggcgatcggt ggcggcctcg rS2 TGCA 

GCCCGTCGTC GCGATGGGCA ACAGCGGTGG GCAGGGCGGA ACgScCgS S^™* 
CAGGGTGGTC GCGCTCGGCC AAACCGTGCA GGCGTCGGAT JcgSScCG S^SST* 

gacattgaac gggttgatcc agttcgatgc cgcaatccag cccggt^aS GA 

CGTCGTCAAC GGCCTAGGAC AGGTGGTCGG TATGAACACG GcSgScG SSSS" 
GCTGTCCCAG GGTGGGCAGG GATTCGCCAT TCCGATCGGG CaSSSS SSJS?^ 
CCAAATCCGA TCGGGTGGGG GGTCACCCAC CGTTCATATC GGG^S^S 
CTTGGGTGTT GTCGACAACA ACGGCAACGG CGCACGAGTC SacSgS S£ ^ 
TCCGGCGGCA AGTCTCGGCA TCTCCACCGG CGACGTgSc SS^S 

gatcaactcg gccaccgcga tggcggacgc gcttaacggg catSS-S SSSSSS 

„ CTCGGTGAAC TGGCAAACCA AGTCGGGCGG CACGCGTACA GGGAACCTGA * " 108 ° 

Q GGGACCCCCG GCCTGATTTG TCGCGGATAC CACCCGCCGG ^^CCGA 1140 

5 CAGCCGTGAT TGCCGCGTGA GCCCCCGAG^ SSSSS SSSSiS G<aTTOaCaC 12 °° 

ffl GCAATGAACG AGGCAGAACA CAGCGTTGAG SScSS SSSSS 

W • £f TC ^GCATCC GGATGCCAAG GACTTCGGCA SSSSSc cSSSSJ 
W GATCCGACCT GGTTTAAGCA CGCCGTCTTC TACGAGGTGC TGGTCcSSc SSSSS 

m GCCAGCGCGG ACGGTTCCGN CGATCTGCGT GGACTCATCG ATCGcSS SJSSSf 

^ TGGCTTGGCA TCGACTGCAT CTGTTGCCGC CGTTCCTACG ACTcScgS ££££22 

^ GGTTACGACA TTCGCGACTT CTACAAGGTG CTGCCCGAAT TCCrrZr-nnl GCGCGACGGC 

GTCGCCCTGG TCGACACCGC TCACCGGCgI SSSSS SSSccS SSST 
AATCACACCT CGGAGTCGCA CCCCTGGTTT CAGGAGTCCC GcSc^ £££££ 
_ TACGGTGACT ATTACGTGTG GAGCGACACC AGCGAG CGCT ACACcSSc 

=" ' G T SccS A S S™* GAACTGGTCA TTCGATCCTG ™- 



(2) INFORMATION FOR SEQ ID NO: 79: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Met Ser Asn Ser Arg Arg Arg Ser Leu Arg Trp Ser Trp Leu Leu Ser 

Val Leu Ala Ala Val Gly Leu Gly Leu Ha Thr Ala Pro Ala Gin Ala 

Ala Pro Pro Ala Leu Ser Gin Asp Arg Phe Ala Asp Phe Pro Ala Leu 

Pro Leu Asp Pro Ser Ala Met Val Ala Gin Val Ala Pro Gin Val Val 

Asn He Asn Thr Lys Leu Gly Tyr Asn Asn Ala "l Gly Ala Gly Thr 

Gly He Val lie Asp Pro Asn Gly Val Val lL Thr Asn Asn His Val 

85 90 
He Ala Gly Ala Thr Asp lie Asn Ala Phe Ser Val Gly Ser Gly Gin 

0 105 no 

Thr Tyr Gly Val Asp Val Val Gly Tyr Asp Arg Thr Gin Asp Val Ala 



120 
130 
240 
300 
360 
420 
430 
540 
600 
660 
720 
780 
840 
900 
960 
1020 



1260 
1320 
1330 
1440 
1500 
1560. 
1620. 
168 0^. 
1740- ' 
1800, 
1860 
1872 
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Val Leu 
130 
Gly Gly 
145 

Gly Gin 



Gly Gin 

Leu Asn 

Gly Gly 
210 
Ala Ala 
225 

He Pro 
Gly Gly 
Gly Val 



115 
Gin 

Val 

Gly 

Thr 

Gly 
195 
Pro 

Ser 

He 

Ser 



Leu Arg 

Ala Val 

Gly Thr 
155 
Val Gin 
180 

Leu He 



Gly Ala 
135 
Gly Glu 
150 

Pro Arg 
Ala Ser 
Gin Phe 



Val Val 
Asp Asn 



Gly Ser 
290 
Thr Ala 
305 

Ala Leu 
Thr Lys 
Pro Pro 



Val 
275 
Ala 



Val 

Asn 

Ser 

Ala 
355 



Gly Gin 
245 
Pro Thr 
260 

Asp Asn 
Pro Ala 
Asp Gly 



Asn Gly 
215 
Phe Gin 
230 

Ala Met 

Val His 

Asn Gly 

Ala Ser 
295 
Ala Pro 
310 

His Pro 



120 

Gly Gly 

Pro Val 

Ala Val 

Asp Ser 
185 
Asp Ala 
200 

Leu Gly 



Leu Pro 

Val Ala 
155 
Pro Gly 
170 

Leu Thr 
Ala He 
Gin Val 



Leu Ser 

Ala He 

He Gly 
255 
Asn Gly 
280 : 
Leu Gly 



Gly His 
325 

Gly Gly Thr Arg 
340 



He Asn 

Gly Asp 

Thr Gly 
345 



Gin Gly 
235 
Ala Gly 
250 

Pro Thr 

Ala Arg 

He Ser 

Ser Ala 
315 
Val He 
330 

Asn Val 



- 125 
Ser Ala 
140 

Met Gly 

Arg Val 

Gly Ala 

Gin Pro 
205 
Val Gly 
220 

Gly Gin 



Ala He Gly 

Asn Ser Gly 
ISO 

v al Ala Leu 
175 

Glu Glu Thr 
190 

Gly Asp Ser 
Met Asn Thr 



Gin Xle 

Ala Phe 

Val Gin 
235 
Thr Gly 
300 

Thr Ala 



Ser Val 
Thr Leu 



Gly Phe Ala 
240 

Arg Ser Gly 
255 

Leu Gly Leu 
270 

Arg Val Val 

Asp Val He 

Met Ala Asp 
320 

Asn Trp Gin 

,335 
Ala Glu Gly 
350 



Mtb8.4 (DPV) 

(2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 500 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 
CGTGGCAATG TCGTTGACCG TCGGGGCCGG GGTCGCCTCC GCAGATCrrn 

CATTAACACC ACCTGCAATT ACGGGCAGGT AGTAGCTGCG SSSSS SS^^ " 

GGCTGCCGCA CAGTTCAACG CCTCACCGGT GGCGCAGTCC S^SgS ^ CCGGG 120 

CGCACCGCCA CCTCAGCGCG CTGCCATGGC CGCGCAaSg S^^tS 18 ° 

ACAGTACATC GGCCTTGTCG AGTCGGTTGC CGGCTCCTGC S^JS SSSf** ^ 

GGGCCCCATC CCGCGACCCG GCATCGTCGC CGGGGCTAG^ JSSSSSf 3 °° 

ACGGGCCGCA TCCCGCGACC CGGCATCGTC GCCGGGGCTA G^SSSg SSSSS "° 

CAACGGGCCG CATCTCGTGC CGAATTCCTG CAGCCCGGGG SJcScS SS^fS ^° 

GCCGCCACCG CGGTGGAGCT rCCACTAG TTCTAGAGCG 4 80 

500 



(2) INFORMATION FOR SEQ ID NO: 102: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 95 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 102 : 

Val Ala Met Ser Leu Thr Val Gly Ala Gly Val Ala Ser Ala Asp Pro 

Val Asp Ala Val lie Asn Thr Thr Cys ^sn Tyr Gly Gin Val Hi Ala 

Ala Leu Asn Ala Thr Asp Pro Gly Ala Ala Ala Gin Phe Isn Ala Ser 

35 40 45 

Pro Val Ala Gin Ser Tyr Leu Arg Asn Phe Leu Ala Ala Pro Pro Pro 

50 55 so 

Gin Arg Ala Ala Met Ala Ala Gin Leu Gin Ala Val Pro Gly Ala Ala 



70 75 



Gin Tyr He Gly Leu Val Glu Ser .Val Ala Gly Ser Cys Asn Asn iL 
35 9 o 95 Y 



Mtbll (Tb38-1) 

(2) INFORMATION FOR SEQ ID NO : 4 6 : 

(i) ■ SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

CGGCACGAGA GACCGATGCC GCTACCCTCG CGCAGGAGGC AGGTAATTTC GAGCGGATCT 
CCGGCGACCT GAAAACCCAG ATCGACCAGG TGGAGTCGAC GGCAGGTTCG TTgSSScc 
AGTGGCGCGG CGCGGCGGGG ACGGCCGCCC AGGCCGCGGT GGTGCGCTTC ^GAaSS 
CCAATAAGCA GAAGCAGGAA CTCGACGAGA TCTCGACGAA TATTCGTCAG GCcScgtJp 

aatactcgag ggccgacgag gagcagcagc aggcgctgtc ctcgcaaatg ggcttctgS 

CCGCTAATAC GAAAAGAAAC GGAGCAA ^ATG GGCTTCTGAC 300 

327 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

Thr Asp Ala Ala Thr Leu Ala Gin Glu Ala Gly Asn Phe Glu Arq n e 

5 10 15 

Ser Gly Asp Leu Lys Thr Gin He Asp Gin Val Glu Ser Thr Ala Gly 



60 
120 
180 
240 
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Ser Leu Gin Gly Gin Trp Arg Gly Ma Ala Gly Thr Ala Ma Gin Ala 

Ala Val val Arg Phe Gin Glu Ma Ala Asn Lys Gin Lys Gin Glu Leu 

53 60 



Asp Glu He ser Thr Asn He Arg Gin Ala Gly Val Gin Tyr Ser Arg 
Ala Asp Glu Glu Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe " 
85 90 95 



TbRa3 

(2) INFORMATION FOR SEQ ID NO : 15 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SSQ ID NO: 15: 



GAATTCGGCA CGAGAGGTGA TCGACATCAT CGGGACCAGC CCCACAT CC T aarimnn 
GGCGGCGGAG GCGGTCCAGC GGGCGCGGGA TAGCGTCGAT GaS^CgS SctSJS 
CATTGAGCAG GACATGGCCG TGGACAGCGC CGGCAAGATC ACCESS TcSSSSa 
AGTGTCGTTC AAGATGAGGC CGGCGCAACC GCGCTAGCAC GGGCcSS GCAtoSoS 
AAATCGCACG GTTTGCGGTT GATTCGTGCG ATTTTGTGTC TGCTOGCcS gSSta^ 
GCGCGGCCCA GGTCCGCGTG CTGCCGTATC CAGGCGTGCA TCGCGaSS GGcSJcSr 
^;°I TAA TG CTTCGCGT CGACCCGAAC TGGGCGATCC GCCgSSgC SSSSS 
CCGTGGCCAG CCCGTCGATG CCCGAGTTGC CCGAGGAAAC GTGCTGCrar ™^ 
AGCGTCCGTA GGCGGCGGTG CTGACCGGCT CTGCCTGCGC £££££ GcSgSSJ 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 

Val He Asp He lie Gly Thr Ser Pro Thr Ser Trp Glu Gin Ala Ala 

Ala Glu Ala Val Gin Arg Ala Arg Asp Ser Val Asp Asp lie tea Val 

20 25 30 

Ala Arg Val lie Glu Gin Asp Met Ala Val Asp Ser Ala Gly Lys lie 

Thr Tyr Arg lie Lys Leu Glu Val Ser Phe Lys Met Irg P ro Ala Gin 

Pro Arg 
65 



60 
120 
180 
240 
3 00 
360 
420 
480 
540 
542 



23 



38kD 



(2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1993 base pairs 
(3) TYPE: nucleic acid* 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1S4; 



60 
120 
130 
240 
300 
360 
420 
430 
540 



TGTTCTTCGA CGGCAGGCTG GTGGAGGAAG GGCCCACCGA ACAGCTGTTC TCCTCGCCr* 
AGCATGCGGA AACCGCCCGA TACGTCGCCG GACTGTCGGG GGACGTCAAG GACGCCAAGr 
GCGG AAATTG AAGAGCACAG AAAGGTATGG CGTGAAAATT CGTTTGCATA CGCTGTrorr 

cSScSS SSSSS TGCTGCTAGC TGTGGCTCGA AACCACcSS 

CGGTTCGCCT GAAACGGGCG CCGGCGCCGG TACTGTCGCG ACTACCCCCG CGTCGTCGrr 
GGTGACGTTG GCGGAGACCG GTAGCACGCT GCTCTACCCG CTGTTCAACC TGTGGGGTCr 
GGCCTTTCAC GAGAGGTATC CGAACGTCAC GATCACCGCT CAGGGCACCG GTTCTGGTrr 
CGGGATCGCG CAGGCCGCCG CCGGGACGGT CAACATTGGG GCCTCCGACG CCTATCTGTC 
m . GGAAGGTGAT ATGGCCGCGC ACAAGGGGCT GATGAACATC GCGCTAGCCA TCTCCGCTCA 

- GCAGGTCAAC TACAACCTGC CCGGAGTGAG CGAGCACCTC AAGCTGAACG GAAAAGTCCT 500 

= .^CAGATCG c££££I 660 

720 
730 
84 0 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
174 0 
1800 
1860 
1920 



CCCCGGCGTG AACCTGCCCG GCACCGCGGT AGTTCCGCTG CACCGCTCCG ACGGGTCCGT 
TGACACCTTC TTGTTCACCC AGTACCTGTC CAAGCAAGAT CCCGAGGGCT GGGG CAAGTC 
GCCCGGCTTC GGCACCACCG TCGACTTCCC GGCGGTGCCG GGTGCGCTGG GTGAGAACGG 
CAACGGCGGC ATGGTGAC CG GTTGCGCCGA GACACCGGGC TGCGTGGCCT ATATCGGCAT 
CAGCTTCCTC GACCAGGCCA GTCAACGGGG ACTCGGCGAG GCCCAACTAG GCAATAGCrr 
TGGCAATTTC TTGTTGCCCG ACGCGCAAAG CATTCAGGCC GCGGCGGCTG GCTTCGCATr 
GAAAACCCCG GCGAACCAGG CGATTTCGAT GATCGACGGG CCCGCCCCGG ACGGCTATPr 
GATCATCAAC TACGAGTACG CCATCGTCAA CAACCGGCAA AAGGACGCCG CCACCGCGCA 
GACCTTGCAG GCATTTCTGC ACTGGGCGAT CACCGACGGC AACAAGGCCT CGTTCCTCGA 
CCAGGTTCAT TTCCAGCCGC TGCCGCCCGC GGTGGTGAAG TTGTCTGACG CGTTGATCGP 
GACGATTTCC AGCTAGCCTC GTTGACCACC ACGCGACAGC AACCTCCGTC GGGCCATCGT 
GCTGCTTTGC GGAGCATGCT GGCCCGTGCC GGTGAAGTCG GCCGCGCTGG CCCGGCCATT 
CGGTGGTTGG GTGGGATAGG TGCGGTGATC CCGCTGCTTG CGCTGGTCTT GGTGCTGCTP 
GTGCTGGTCA TCGAGGCGAT GGGTGCGATC AGGCTCAACG GGTTGCATTT CTTCACCGrr 
ACCGAATGGA ATCCAGGCAA CACCTACGGC GAAACCGTTG TCACCGACGC GTCGCCCATC 
CGGTCGGCGC CTACTACGGG GCGTTGCCGC TGATCGTCGG GACGCTGGCG ACCTCGGCAA 
TCGCCCTGAT CATCGCGGTG CCGGTCTCTG TAGGAGCGGC GCTGGTGATC GTGGAACGGC 
TGCCGAAACG GTTGGCCGAG GCTGTGGGAA TAGTCCTGGA ATTGCTCGCC GGAATrrrrA 
GCGTGGTCGT CGGTTTGTGG GGGGCAATGA CGTTCGGGCC GTTCATCGCT CATCACATOG 
CTCCGGTGAT CGCTCACAAC GCTCCCGATG TGCCGGTGCT GAACTACTTG CGCGGCGACC 
CGGGCAACGG GGAGGGCATG TTGGTGTCCG GTCTGGTGTT GGCGGTGATG GTCGTTCCCA 

SJ?cS^ Sc^^ GACCTGTTCC GGTGTTGCCC CGGGAGGGCG 1980 

1993 



(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: • 

(A) LENGTH:. 3 74 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 5: 



24 



Met Lys lie Arg Leu His Thr Leu Leu Ala Val Leu T*r AT a Ala ?>-o 
1 5 10 15 ' 

Leu Leu Leu Ala Ala Ala Gly Cys Gly Ser Lys Pro Pro Ser Glv s*r 
20 2S 30 7 SSr 

Pro Glu Thr Gly Ala Gly Ala Gly Thr Val Ala Thr Thr Ala Se- 

35 40 45 

Ser Pro Val Thr Leu Ala Glu Thr Gly Ser Thr Leu Leu Tyr Pro Leu 
50 ss 60 

Phe Asn Leu Trp Gly Pro Ala Phe His Glu Arg Tyr Pro Asn Val Thr 
55 70 75 8Q 

He Thr Ala Gin Gly Thr Gly Ser Gly Ala Gly H e Ala Gin Ala Ala 
35 90 9S 

Ala Gly Thr Val Asn He Gly Ala Ser Asp Ala Tyr Leu Ser Glu Gly 

ioo 105 110 



Asp Met Ala Ala His Lys Gly Leu Met Asn He Ala Leu Ala He Ser 
115 120 12 5 

Ala Gin Gin Val Asn Tyr Asn Leu Pro Gly Val Ser Glu His Leu Lvs 
130 135 i4 0 

Leu Asn Gly Lys Val Leu Ala Ala Met Tyr Gin Gly Thr He Lvs Thr 

I 45 150 i=;q 

135 160 



Pro 



Trp Asp Asp Pro Gin He Ala Ala Leu Asn Pro Gly Val Asn Leu 
1S5 170 17S 

Gly Thr Ala Val Val Pro Leu His Arg Ser Asp Gly Ser Gly Asp Thr 
180 18S 190 

Phe Leu Phe Thr Gin Tyr Leu Ser Lys Gin Asp Pro Glu Gly Trp Glv 
19S 200 205 

Lys Ser Pro Gly Phe Gly Thr Thr Val Asp Phe Pro Ala Val Pro Gly 
210 215 220 

Ala Leu Gly Glu Asn Gly Asn Gly Gly Met Val Thr Gly Cys Ala Glu 
225 230 235 240 

Thr Pro Gly Cys Val Ala Tyr He Gly He Ser Phe Leu Asp Gin Ala 
245 250 255 

Ser Gin Arg Gly Leu Gly Glu Ala Gin Leu Gly Asn Ser Ser Gly Asn 
2S0 2SS 270 

Phe Leu Leu Pro Asp Ala Gin Ser He Gin Ala Ala Ala Ala Gly Phe 
275 280 285 

Ala Ser Lys Thr Pro Ala Asn Gin Ala He Ser Met He Asp Gly Pro 
290 29S 3 00 
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Ala Pro Asp Gly Tyr Profile lie Asn Tyr Glu Tyr Ala He Val Asr 
305 310 315 320 * 

Asn Arg Gin Lys Asp Ala Ala Thr Ala Gin Thr Leu Gin Ala Phe Leu 
325 330 335 

His Trp Ala He Thr Asp Gly Asn Lys Ala Ser Phe Leu Asp Gin Vai 
340 345 350 

Kis Phe Gin Pro Leu Pro Pro Ala Val Val Lys Leu Ser Aso Ala Leu 
355 360 355 

He Ala Thr He Ser Ser 
370 



DPEP 

(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 999 fcas~e pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

ATGCATCACC ATCACCATCA CATGCATCAG GTGGACCCCA ACTTGACACG TCGCAAGGGA 60 

CGATTGGCGG CACTGGCTAT CGCGGCGATG GCCAGCGCCA GCCTGGTGAC CGTTGCGGTG 12 0 

CCCGCGACCG CCAACGCCGA TCCGGAGCCA GCGCCCCCGG TACCCACAAC GGCCGCCTCG 180 

CCGCCGTCGA CCGCTGCAGC GCCACCCGCA CCGGCGACAC CTGTTGCCCC CCCACCACCG 240 

GCCGCCGCCA ACACGCCGAA TGCCCAGCCG GGCGATCCCA ACGCAGCACC TCCGCCGGCC 3 00 

GACCCGAACG CACCGCCGCC ACCTGTCATT GCCCCAAACG CACCCCAACC TGTCCGGATC 3 60 

GACAACCCGG TTGGAGGATT CAGCTTCGCG CTGCCTGCTG GCTGGGTGGA GTCTGACGCC 42 0 

GCCCACTTCG ACTACGGTTC AGCACTCCTC AGCAAAACCA CCGGGGACCC GCCATTTCCC 48 0 

GGACAGCCGC CGCCGGTGGC CAATGACACC CGTATCGTGC TCGGCCGGCT AGACCAAAAG 540 

CTTTACGCCA GCGCCGAAGC CACCGACTCC AAGGCCGCGG CCCGGTTGGG CTCGGACATG 600 

GGTGAGTTCT ATATGCCCTA CCCGGGCACC CGGATCAACC AGGAAACCGT CTCGCTCGAC 660 

GCCAACGGGG TGTCTGGAAG CGCGTCGTAT TACGAAGTCA AGTTCAGCGA TCCGAGTAAG 72 0 

CCGAACGGCC AGATCTGGAC GGGCGTAATC GGCTCGCCCG CGGCGAACGC ACCGGACGCC 780 

GGGCCCCCTC AGCGCTGGTT TGTGGTATGG CTCGGGACCG CCAACAACCC GGTGGACAAG 840 

GGCGCGGCCA AGGCGCTGGC CGAATCGATC CGGCCTTTGG TCGCCCCGCC GCCGGCGCCG 900 

GCACCGGCTC CTGCAGAGCC CGCTCCGGCG CCGGCGCCGG CCGGGGAAGT CGCTCCTACC 960 

CCGACGACAC CGACACCGCA GCGGACCTTA CCGGCCTGA 999 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 32 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
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Met His His His His His His Met His Gin Val Asp Pro Asn Leu Thr 

1 S 10 1S 

Arg Arg Lys Gly Arg Leu Ala Ala Leu Ala lie Ala Ala Met Ala Ser 

20 25 30 

Ala Ser Leu Val Thr Val Ala Val Pro Ala Thr Ala Asa Ala Asp Pro 

35 40 45 

Glu Pro Ala Pro Pro Val Pro Thr Thr Ala Ala Ser Pro Pro Ser Thr 

50 55 60 - 

Ala Ala Ala Pro Pro Ala Pro Ala Thr Pro Val Ala Pro Pro Pro Pro 
65 70 75 * 80 

Ala Ala Ala Asn Thr Pro Asn Ala Gin Pro Gly Asp Pro Asn Ala Ala 

35 90 95 

Pro Pro Pro Ala Asp Pro Asn Ala Pro Pro Pro Pro Val lie Ala Pro 

100 105 HO 

Asn Ala Pro Gin Pro Val Arg lie Asp Asn Pro Val Gly Gly Phe Ser 

115 120 125 

Phe Ala Leu Pro Ala Gly Trp Val Glu Ser Asp Ala Ala His Phe Asp 
130 135 140 

D Tyr Gly Ser Ala Leu Leu Ser Lys Thr Thr Gly Asp Pro Pro Phe Pro 

U3 145 150 155 160 

IJI Gly Gin Pro Pro Pro Val Ala Asn Asp Thr Arg lie Val Leu Gly Arg 

ffl * 155 170 175 

yj Leu Asp Gin Lys Leu Tyr Ala Ser Ala Glu Ala Thr Asp Ser Lys Ala 

m 180 185 190 

Sj Ala Ala Arg Leu Gly Ser Asp Met Gly Glu Phe Tyr Met Pro Tyr Pro 

fy 195 200 205 

g Gly Thr Arg lie Asn Gin Glu Thr Val Ser Leu Asp Ala Asn Gly Val 

l-e. 210 215 220 

2 . Ser Gly Ser Ala Ser Tyr Tyr Glu Val Lys Phe Ser Asp Pro Ser Lys 

H 225 230 235 240 

=i Pro Asn Gly Gin lie Trp Thr Gly Val lie Gly Ser Pro Ala Ala Asn 

S 245 250 255 

2 Ala Pro Asp Ala Gly Pro Pro Gin Arg Trp Phe Val Val Trp Leu Gly 

260 265 270 

Thr Ala Asn Asn Pro Val Asp Lys Gly Ala Ala Lys Ala Leu Ala Glu 

275 280 285 

Ser lie Arg Pro Leu Val Ala Pro Pro Pro Ala Pro Ala Pro Ala Pro 

290 295 300 

Ala Glu Pro Ala Pro Ala Pro Ala Pro Ala Gly Glu Val Ala Pro Thr 
305 310 315 320 

Pro Thr Thr Pro Thr Pro Gin Arg Thr Leu Pro Ala 
325 330 



TbH4 

(2) INFORMATION FOR SEQ ID NO; 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CGGCACGAGG ATCGGTACCC CGCGGCATCG GCAGCTGCCG ATTCGCCGGG TTTCCCCACC 60 
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CGAGGAAAGC CGCTACCAGA TGGCGCTGCC GAAGTAGGGC GATCCGTTCG CGATGCCGGC 
ATGAACGGGC GG CATCAAAT TAGTGCAGGA ACCTTTCAGT TTAGCGACGA TAATGGCTAT 
AGCACTAAGG AGGATGATCC GATATGACGC AGTCGCAGAC CGTGACGGTG GATCAGCAAG 
AGATTTTGAA CAGGGCCAAC GAGGTGGAGG CCCCGATGGC GGACCCACCG ACTGATGTCC 
CCATCACACC GTGCGAACTC ACGGNGGNTA AAAACGCCGC CCAACAGNTG GT>JTTGTCCG 
CCGACAACAT GCGGGAATAC CTGGCGGCCG GTGCCAAAGA GCGGCAGCGT CTGGCGACCT 
CGCTGCGCAA CGCGGCCAAG GNGTATGGCG AGGTTGATGA GGAGGCTGCG ACCGCGCTGG 
ACAACGACGG CGAAGGAACT GTGCAGGCAG AATCGGCCGG GGCCGTCGGA GGGGACAGTT 
CGGCCGAACT AACCGATACG CCGAGGGTGG CCACGGCCGG TGAACCCAAC TTCATGGATC 
TCAAAGAAGC GGCAAGGAAG CTCGAAACGG GCGACCAAGG CGCATCGCTC GCGCACTGNG 
GGGATGGGTG GAACACTTNC ACCCTGACGC TGCAAGGCGA CG 



120 
130 
240 
300 
350 
420 
430 
540 
600 
660 
702 



(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 amino acids 

(B) TYPE: amino acid 

_ (C) STRANDEDNESS : single 

2 (D) TOPOLOGY: linear 

^ SEQUENCE DESCRIPTION : SEQ ID NO: 81: 

U Gly Asp Ser Phe Trp Ala Ala Ala Asp Gin Met Ala Arg Gly Phe Val 

K 1 5 10 15 

J? Leu G1 y Ala Thr Ala Gly Arg Thr Thr Leu Thr Gly Glu Gly Leu Gin 

ly 20 25 30 

f His Ala Asp Gly His Ser Leu Leu Leu Asp Ala Thr Asn Pro Ala Val 

N 5 35 40 45 

P * Va l Ala Tyr Asp Pro Ala Phe Ala Tyr Glu lie Gly Tyr He Xaa Glu 

M . 50 55 60 

3 Ser Gly Leu Ala Arg Met Cys Gly Glu Asn Pro Glu Asn He Phe Phe 

□ 65 70 75 80 

3 Tyr He Thr Val Tyr Asn Glu Pro Tyr Val Gin Pro Pro Glu Pro Glu 

85 90 95 

Asn Phe Asp Pro Glu Gly Val Leu Gly Gly He Tyr Arg Tyr His Ala 

100 105 X10 

Ala Thr Glu Gin Arg Thr Asn Lys Xaa Gin He Leu Ala Ser Gly Val 

115 120 12 5 

Ala Met Pro Ala Ala Leu Arg Ala Ala Gin Met Leu Ala Ala Glu Trp 

130 135 140 

Asp Val Ala Ala Asp Val Trp ■ Ser Val Thr Ser Trp Gly Glu Leu Asn 
145 150 155 160 

Arg Asp Gly Val Val He Glu Thr Glu Lys Leu Arg His Pro Asp Arg 

165 170 175 

Pro Ala Gly Val Pro Tyr Val Thr Arg Ala Leu Glu Asn Ala Arg Gly 

180 185 190 

Pro Val He Ala Val Ser Asp Trp Met Arg Ala Val Pro Glu Gin He 

195 200 205 

Arg Pro Trp Val Pro Gly Thr Tyr Leu Thr Leu Gly Thr Asp Gly Phe 

210 215 220 

Gly Phe Ser Asp Thr Arg Pro Ala Gly Arg Arg Tyr Phe Asn Thr Asp 
225 230 235 240 

Ala Glu Ser Gin Val Gly Arg Gly Phe Gly Arg Gly Trp Pro Gly Arg 

245 250 255 

Arg Val Asn He Asp Pro Phe Gly Ala Gly Arg Gly Pro Pro Ala Gin 

260 • 265 270 

Leu Pro Gly Phe Asp Glu Gly Gly Gly Leu Arg ' Pro Xaa Lys 
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275 230 235 



MTbRal2 

(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

CGGTATGAAC ACGGCCGCGT CCGATAACTT CCAGCTGTCC CAGGGTGGGC AGGGATTCGC 60 

CATTCCGATC GGGCAGGCGA TGGCGATCGC GGGCCAGATC CGATCGGGTG GGGGGTCACC 120 

gg CACCGTTCAT ATCGGGCCTA CCGCCTTCCT CGGCTTGGGT GTTGTCGACA ACAACGGCAA 18 0 

J CGGCGCACGA GTCCAACGCG TGGTCGGGAG CGCTCCGGCG GCAAGTCTCG GCATCTCCAC 24 0 

J CGGCGACGTG ATCACCGCGG TCGACGGCGC TCCGATCAAC TCGGCCACCG CGATGGCGGA 3 00 

g * CGCGCTTAAC GGG CAT CATC CCGGTGACGT CATCTCGGTG AACTGGCAAA CCAAGTCGGG 3 60 

p CGGCACGCGT ACAGGGAACG TGACATTGGC CGAGGGACCC CCGGCCTGAT TTCGTCGYGG 420 

^ ATACCACCCG CCGGCCGGCC AATTGGA 447 

!y (2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 
^ ' (A) LENGTH: 13 2 amino acids 

^ (B) TYPE: amino acid 

^ (C) STRANDEDNESS : single 

3 (D) TOPOLOGY: linear 



(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


:66: 








Thr 


Ala 


Ala 


Ser Asp Asn Phe 


Gin 


Leu 


Ser 


Gin 


Gly Gly Gin Gly Phe 


1 






5 






10 




15 




Ala 


He 


Pro 


He Gly Gin Ala 


Met 


Ala 


He 


Ala 


Gly Gin He Arg 


Ser 








20 




25 






30 




Gly 


Gly 


Gly 


Ser Pro Thr Val 


His 


He 


Gly 


Pro 


Thr Ala Phe Leu 


Gly 






35 




40 








45 


Leu 


Gly 


Val 


Val Asp Asn Asn 


Gly 


Asn 


Gly Ala Arg Val Gin Arg 


Val 




50 




55 










60 




Val 


Gly 


Ser 


Ala Pro Ala Ala 


Ser 


Leu 


Gly 


He 


Ser Thr Gly Asp 


Val 


65 






70 








75 


80 


He 


Thr 


Ala 


Val Asp Gly Ala 


Pro 


He 


Asn 


Ser 


Ala Thr Ala Met 


Ala 








85 






90 




95 




Asp 


Ala 


Leu Asn Gly His His 


Pro Gly Asp 


Val 


He Ser Val Asn 


Trp 








100 




105 






110 


Gin 


Thr 


Lys 


Ser Gly Gly Thr 


Arg 


Thr 


Gly 


Asn 


Val Thr Leu Ala 


Glu 






115 




120 








125 




Gly 


Pro 


Pro 


Ala 
















130 



















DPPD 



29 



(2) INFORMATION FOR SEQ ID NO: 2 40: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 9 base pairs 
(3) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 

ATGAAGTTGA AGTTTGCTCG CCTGAGTACT GCGATACTGG GTTGTGCAGC GGCGCTTGTG 60 

TTTCCTGCCT CGGTTGCCAG CGCAGATCCA CCTGACCCGC ATCAGCCGGA CATGACGAAA 120 

GGCTATTGCC CGGGTGGCCG ATGGGGTTTT GGCGACTTGG CCGTGTGCGA CGGCGAGAAG 180 

TACCCCGACG GCTCGTTTTG GCACCAGTGG ATGCAAACGT GGTTTACCGG CCCACAGTTT 2 40 

TACTTCGATT GTGTCAGCGG CGGTGAGCCC CTCCCCGGCC CGCCGCCACC GGGTGGTTGC 3 00 

GGTGGGGCAA TTCCGTCCGA GCAGCCCAAC GCTCCCTGA 339 



(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 112 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 



Met 


Lys 


Leu Lys 


Phe 


Ala Arg 


Leu 


Ser 


Thr 


Ala 


He 


Leu 


Gly Cys 


Ala 


1 






5 








10 








15 




Ala 


Ala 


Leu Val 


Phe 


Pro Ala 


Ser 


Val 


Ala 


Ser 


Ala 


Asp 


Pro Pro 


Asp 






20 








25 










30 




Pro 


His 


Gin Pro 


Asp 


Met Thr 


Lys 


Gly 


Tyr 


Cys 


Pro 


Gly Gly Arg Trp 






35 






40 










45 






Gly 


Phe' 


Gly Asp 


Leu 


Ala Val 


Cys 


Asp 


Gly 


Glu 


Lys 


Tyr 


Pro Asp 


Gly 




50 






55 










60 








Ser 


Phe 


Trp His 


Gin 


Trp Met 


Gin 


Thr 


Trp 


Phe 


Thr 


Gly 


Pro Gin 


Phe 


65 








70 








75 








80 


Tyr 


Phe 


Asp Cys 


Val 


Ser Gly Gly Glu 


Pro 


Leu 


Pro 


Gly 


Pro Pro 


Pro 








85 








90 








95 




Pro 


Gly 


Gly Cys 


Gly Gly Ala 


lie 


Pro 


Ser 


Glu 


Gin 


Pro 


Asn Ala 


Pro 






100 








105 










110 
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(2) INFORMATION FOR SEQ ID NO: 103: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 base pairs 

(B) TYPE: nucleic acid 



30 



* . 4 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SSQ ID NO:103: 

ATGACAGAGC AGCAGTGGAA TTTCGCGGGT ATCGAGGCCG CGGCAAGCGC AATCCAGGGA 60 
AATGTCACGT CCATTCATTC CCTCCTTGAC GAGGGGAAGC AGTCCCTGAC CAAGCTCGCA 120 
GCGGCCTGGG GCGGTAGCGG TTCGGAAGCG TACC 154 



(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 amino acids 

(B) TYPE: amino acid 

(C) STRAND ED NESS : single 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:104: 

Met Thr Glu Gin Gin Trp Asn Phe Ala Gly lie Glu Ala Ala Ala Ser 

15 10 15 

Ala lie Gin Gly Asn Val Thr Ser lie His Ser Leu Leu Asp Glu Gly 

20 25 30 

Lys Gin Ser Leu Thr Lys Leu Ala Ala Ala Trp Gly Gly Ser Gly Ser 

35 40 45 

Glu Ala Tyr 
50 
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